208 research outputs found
Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing
The accuracy of Automated Speech Recognition (ASR) technology has improved,
but it is still imperfect in many settings. Researchers who evaluate ASR
performance often focus on improving the Word Error Rate (WER) metric, but WER
has been found to have little correlation with human-subject performance on
many applications. We propose a new captioning-focused evaluation metric that
better predicts the impact of ASR recognition errors on the usability of
automatically generated captions for people who are Deaf or Hard of Hearing
(DHH). Through a user study with 30 DHH users, we compared our new metric with
the traditional WER metric on a caption usability evaluation task. In a
side-by-side comparison of pairs of ASR text output (with identical WER), the
texts preferred by our new metric were preferred by DHH participants. Further,
our metric had significantly higher correlation with DHH participants'
subjective scores on the usability of a caption, as compared to the correlation
between WER metric and participant subjective scores. This new metric could be
used to select ASR systems for captioning applications, and it may be a better
metric for ASR researchers to consider when optimizing ASR systems.Comment: 10 pages, 8 figures, published in ACM SIGACCESS Conference on
Computers and Accessibility (ASSETS '17
Multilingual Word Sense Induction to Improve Web Search Result Clustering
In [12] a novel approach to Web search result clustering based on Word Sense Induction, i.e. the automatic discovery of word senses from raw text was presented; key to the proposed approach is the idea of, first, automatically in- ducing senses for the target query and, second, clustering the search results based on their semantic similarity to the word senses induced. In [1] we proposed an innovative Word Sense Induction method based on multilingual data; key to our approach was the idea that a multilingual context representation, where the context of the words is expanded by considering its translations in different languages, may im- prove the WSI results; the experiments showed a clear per- formance gain. In this paper we give some preliminary ideas to exploit our multilingual Word Sense Induction method to Web search result clustering
Beyond Textual Issues: Understanding the Usage and Impact of GitHub Reactions
Recently, GitHub introduced a new social feature, named reactions, which are
"pictorial characters" similar to emoji symbols widely used nowadays in
text-based communications. Particularly, GitHub users can use a pre-defined set
of such symbols to react to issues and pull requests. However, little is known
about the real usage and impact of GitHub reactions. In this paper, we analyze
the reactions provided by developers to more than 2.5 million issues and 9.7
million issue comments, in order to answer an extensive list of nine research
questions about the usage and adoption of reactions. We show that reactions are
being increasingly used by open source developers. Moreover, we also found that
issues with reactions usually take more time to be handled and have longer
discussions.Comment: 10 page
Second-Order Belief Hidden Markov Models
Hidden Markov Models (HMMs) are learning methods for pattern recognition. The
probabilistic HMMs have been one of the most used techniques based on the
Bayesian model. First-order probabilistic HMMs were adapted to the theory of
belief functions such that Bayesian probabilities were replaced with mass
functions. In this paper, we present a second-order Hidden Markov Model using
belief functions. Previous works in belief HMMs have been focused on the
first-order HMMs. We extend them to the second-order model
The Relationship Between Plasma Flow Doppler Velocities and Magnetic Field Parameters During the Emergence of Active Regions at the Solar Photospheric Level
A statistical study has been carried out of the relationship between plasma
flow Doppler velocities and magnetic field parameters during the emergence of
active regions at the solar photospheric level with data acquired by the
Michelson Doppler Imager (MDI) onboard the Solar and Heliospheric Observatory
(SOHO). We have investigated 224 emerging active regions with different spatial
scales and positions on the solar disc. The following relationships for the
first hours of the emergence of active regions have been analysed: i) of peak
negative Doppler velocities with the position of the emerging active regions on
the solar disc; ii) of peak plasma upflow and downflow Doppler velocities with
the magnetic flux growth rate and magnetic field strength for the active
regions emerging near the solar disc centre (the vertical component of plasma
flows); iii) of peak positive and negative Doppler velocities with the magnetic
flux growth rate and magnetic field strength for the active regions emerging
near the limb (the horizontal component of plasma flows); iv) of the magnetic
flux growth rate with the density of emerging magnetic flux; v) of the Doppler
velocities and magnetic field parameters for the first hours of the appearance
of active regions with the total unsigned magnetic flux at the maximum of their
development.Comment: 14 pages, 8 figures. The results of article were presented at the
ESPM-13 (12-16 September 2011, Rhodes, Greece, Abstract Book p. 102-103,
P.4.13,
http://astro.academyofathens.gr/espm13/documents/ESPM13_abstract_programme_book.pdf
Structural Invariance of Sunspot Umbrae Over the Solar Cycle: 1993-2004
Measurements of maximum magnetic flux, minimum intensity, and size are
presented for 12 967 sunspot umbrae detected on the NASA/NSO
spectromagnetograms between 1993 and 2004 to study umbral structure and
strength during the solar cycle. The umbrae are selected using an automated
thresholding technique. Measured umbral intensities are first corrected for a
confirming observation of umbral limb-darkening. Log-normal fits to the
observed size distribution confirm that the size spectrum shape does not vary
with time. The intensity-magnetic flux relationship is found to be steady over
the solar cycle. The dependence of umbral size on the magnetic flux and minimum
intensity are also independent of cycle phase and give linear and quadratic
relations, respectively. While the large sample size does show a low amplitude
oscillation in the mean minimum intensity and maximum magnetic flux correlated
with the solar cycle, this can be explained in terms of variations in the mean
umbral size. These size variations, however, are small and do not substantiate
a meaningful change in the size spectrum of the umbrae generated by the Sun.
Thus, in contrast to previous reports, the observations suggest the equilibrium
structure, as testified by the invariant size-magnetic field relationship, as
well as the mean size (i.e. strength) of sunspot umbrae do not significantly
depend on solar cycle phase.Comment: 17 pages, 6 figures. Published in Solar Physic
Resolving the infinitude controversy
A simple inductive argument shows natural languages to have infinitly many sentences, but workers in the field have uncovered clear evidence of a diverse group of âexceptionalâ languages from Proto-Uralic to Dyirbal and most recently, PirahĂŁ, that appear to lack recursive devices entirely. We argue that in an information-theoretic setting non-recursive natural languages appear neither exceptional nor functionally inferior to the recursive majority
Higher dietary flavone, flavonol, and catechin intakes are associated with less of an increase in BMI over time in women: a longitudinal analysis from the Netherlands Cohort Study
BACKGROUND: Dietary flavonoids are suggested to have antiobesity effects. Prospective evidence of an association between flavonoids and body mass index (BMI) is lacking in general populations. OBJECTIVE: We assessed this association between 3 flavonoid subgroups and BMI over a 14-y period in 4280 men and women aged 55-69 y at baseline from the Netherlands Cohort Study. DESIGN: Dietary intake was estimated at baseline (1986) by a validated food-frequency questionnaire. BMI was ascertained through self-reported height (in 1986) and weight (in 1986, 1992, and 2000). Analyses were based on sex-specific quintiles for the total intake of 6 catechins and of 3 flavonols/flavones. Linear mixed effect modeling was used to assess longitudinal associations in 3 adjusted models: age only, lifestyle (age, energy intake, physical activity, smoking status, alcohol intake, type 2 diabetes, and coffee consumption), and lifestyle and diet (vegetables, fruit, fiber, grains, sugar, dessert, and dieting habits). RESULTS: After adjustment for age and confounders, the BMI (kg/m(2)) of women with the lowest intake of total flavonols/flavones and total catechins increased by 0.95 and 0.77, respectively, after 14 y. Women with the highest intake of total flavonols/flavones and total catechins experienced a significantly lower increase in BMI of 0.40 and 0.31, respectively (between group difference: P < 0.05). This difference remained after additional adjustment for dietary determinants and after stratification of median baseline BMI. In men, no significant differences in BMI change were observed over the quintiles of flavonoid intake after 14 y. CONCLUSION: Our results suggest that flavonoid intake may contribute to maintaining body weight in the general female population. AD - .s FAU - Hughes, Laura A E AU - CN - Netherlands Cohort Study LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - United States TA - Am J Clin Nutr JT - The American journal of clinical nutrition JID - 0376027 SB - AIM SB - I
Positive words carry less information than negative words
We show that the frequency of word use is not only determined by the word
length \cite{Zipf1935} and the average information content
\cite{Piantadosi2011}, but also by its emotional content. We have analyzed
three established lexica of affective word usage in English, German, and
Spanish, to verify that these lexica have a neutral, unbiased, emotional
content. Taking into account the frequency of word usage, we find that words
with a positive emotional content are more frequently used. This lends support
to Pollyanna hypothesis \cite{Boucher1969} that there should be a positive bias
in human expression. We also find that negative words contain more information
than positive words, as the informativeness of a word increases uniformly with
its valence decrease. Our findings support earlier conjectures about (i) the
relation between word frequency and information content, and (ii) the impact of
positive emotions on communication and social links.Comment: 16 pages, 3 figures, 3 table
- âŠ